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I review and expand the model of quantum associative 
memory that I have recently proposed. In this model bi- 
nary patterns of n bits are stored in the quantum superposi- 
tion of the appropriate subset of the computational basis of n 
qbits. Information can be retrieved by performing an input- 
dependent rotation of the memory quantum state within this 
subset and measuring the resulting state. The amplitudes of 
this rotated memory state are peaked on those stored pat- 
terns which are closest in Hamming distance to the input, 
resulting in a high probability of measuring a memory pat- 
tern very similar to it. The accuracy of pattern recall can be 
tuned by adjusting a parameter playing the role of an effec- 
tive temperature. This model solves the well-known capacity 
shortage problem of classical associative memories, providing 
an exponential improvement in capacity. The price to pay 
is the probabilistic nature of information retrieval, a feature 
that, however, this model shares with our own brain. 

I. INTRODUCTION 

The power of quantum computation 0| is mostly as- 
sociated with the speed-up in computing time it can 
provide with respect to its classical counterparts, the 
paramount examples being Shor's factoring algorithm ||] 
and Grover's search algorithm ||]. There is, however, an- 
other aspect of quantum computation which represents a 
big improvement upon its classical counterpart [Q . This 
leads to an exponential increase in a particular memory 
capacity rather than speed. In this paper I will review 
and expand the main aspects of this new application of 
quantum information theory. Further aspects of it can 
be found in 

In traditional computers the storage of information re- 
quires setting up a lookup table (RAM). The main dis- 
advantage of this address-oriented memory system lies in 
its rigidity. Retrieval of information requires a precise 
knowledge of the memory address and, therefore, incom- 
plete or corrupted inputs are not permitted. 

This is definitely not how our own brain works. When 
trying to recognize a person from a blurred photo it is 
totally useless to know that it is the 17384th person you 
met in your life. Rather, the recognition process is based 
on our strong power of association with stored memories 
that resemble the given picture. Association is what we 
use every time we solve a crossword puzzle and is distinc- 
tive of the human brain. 

Given the superior power of associative pattern recog- 



nition for complex tasks, the shortcomings of RAM mem- 
ories were addressed by introducing models of associative 
(or content-addressable) memories Here, recall of in- 
formation is possible on the basis of partial knowledge 
of their content, without knowing the storage location. 
These are examples of collective computation on neural 
networks , the best known example being the Hopfield 
model and its generalization to a bidirectional asso- 
ciative memory . 

While these models solve the problem of recalling in- 
complete or noisy inputs, they suffer from a severe capac- 
ity shortage. Due to the phenomenon of crosstalk, which 
is essentially a manifestation of the spin glass transition 
1^ in the corresponding spin systems, the maximum num- 
ber of binary patterns that can be stored in a Hopfield 
network of n neurons is Pmax — 0.14 n D . While various 
possible improvements can be introduced the maxi- 
mum number of patterns remains linear in the number 
of neurons, pmax = 0{n). 

Quantum mechanics offers a way out from the impos- 
sibility of reconciling the association power of content- 
addressable memories with the requirement of large stor- 
age capacity. Indeed, quantum mechanical entanglement 
provides a natural mechanism for both improving dra- 
matically the storage capacity of associative memories 
and retrieving corrupted or incomplete information. 

The basic idea is to store the given p binary patterns of 
n bits in a quantum superposition of the corresponding 
subset of the computational basis of n qbits. The number 
of binary patterns that can be stored in such a quantum 
associative memory is exponential in the number n of 
qbits, Pmax = 2", i.e. it is optimal in the sense that all 
binary patterns that can be formed with n bits can be 
stored. 

The basic idea of the information retrieval mechanism 
is very simple. Given an input pattern, the memory 
quantum state is rotated within the subspace defined 
by the stored patterns so that the resulting amplitudes 
are peaked on the stored patterns which are closest in 
Hamming distance to the input. A measurement of the 
rotated memory quantum state provides the output pat- 
tern. 

An efficient way to perform this rotation is to embed 
the memory quantum state in a larger Hilbert space by 
adding b control qbits. The full state is then rotated 
in the enlarged Hilbert space. After this rotation one 
is interested only in the projection of the rotated state 
onto a specific subspace of the enlarged Hilbert space. 
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This projection can be obtained either by repeated mea- 
surement or by rotating the state (approximately) to the 
desired subspace using the ampUtude ampHfication tech- 
nique Either way one has to repeat a certain algo- 
rithm a number of times and measure the control regis- 
ter to check if the desired projection has been obtained. 
The information retrieval mechanism is thus probabilis- 
tic, with postselection of the measurement result. This 
means that one has to repeat an algorithm until a thresh- 
old T is reached or the measurement of a control register 
yields a given result. In the former case the input is not 
recognized. In the latter case, instead, the output is de- 
termined itself by a probability distribution on the mem- 
ory which is peaked around the stored patterns closest in 
Hamming distance to the input. 

The accuracy of this information retrieval mecha- 
nism depends on the distribution of the stored patterns. 
Recognition efficiency is best when the number of stored 
patterns is very large while identification efficiency is best 
for isolated patterns which are very different from all 
other ones, both very intuitive features. Both efficiencies 
can be tuned to prescribed accuracy levels. The recog- 
nition efhciency can be varied by changing the threshold 
T: the higher T, the larger the number of qbits that can 
be corrupted without affecting recognition. The identi- 
fication efficiency, instead, can be tuned by varying the 
number b of control qbits in the memory. As we shall see, 
b ^ l/t plays the role of an inverse effective temperature 
t. The lower t, the more concentrated is the correspond- 
ing effective Boltzmann distribution on the states closest 
(in Hamming distance) to the input and the better be- 
comes the identification. 

By averaging over the distribution of stored patterns 
one can eliminate the dependence on the stored pattern 
distribution and derive the effective statistical mechan- 
ics of quantum associative memories by introducing the 
usual thermodynamic potentials. In particular, the free 
energy F{t) describes the average behaviour of the re- 
call mechanism at temperature t and provides concrete 
criteria to tune the accuracy of the associative memory. 
By increasing b (lowering t) , the associative memory un- 
dergoes a phase transition from a disordered phase with 
no correlation between input and output to an ordered 
phase with minimal Hamming distance bewteen the in- 
put and the output. This extends to quantum informa- 
tion theory the relation with Ising spin systems known 
in error-correcting codes and in public key cryptog- 
raphy 0. 



II. STORING INFORMATION 

Let me start by describing the elementary quantum 
gates that I will use in the rest of the paper. First 
of all there are the single-qbit gates represented by the 
Pauli matrices Ui, i = 1...3. The first Pauli matrix 
(Ti, in particular, implements the NOT gate. Another 



single-qbit gate is the Hadamard gate H, with the matrix 
representation 
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Then, I will use extensively the two-qbit XOR (exclusive 
OR) gate, which performs a NOT on the second qbit if 
and only if the first one is in state In matrix notation 
this gate is represented as XOR — diag(l,cri), where 1 
denotes a two-dimensional identity matrix and tJi acts on 
the components |01) and |I1) of the Hilbert space. The 
2X0R, or Toffoli gate is the three qbit generalization of 
the XOR gate: it performs a NOT on the third qbit if and 
only if the first two are both in state |I). In matrix nota- 
tion it is given by 2X0R = diag (1, 1, cri). In the storage 
algorithm I shall make use also of the nXOR generaliza- 
tion of these gates, in which there are n control qbits. 
This gate is also used in the subroutines implementing 
the oracles underlying Grover's algorithm [0 and can be 
realized using unitary maps affecting only few qbits at 
a time [|3|, which makes it feasible. All these are stan- 
dard gates. In addition to them I introduce the two-qbit 
controlled gates 




(2) 



for i = 1, . . . ,p. These have the matrix notation CS^ = 
diag (1,5*). For all these gates I shall indicate by sub- 
scripts the qbits on which they are applied, the control 
qbits coming always first. 

Given p binary patterns of length n, it is not diffi- 
cult to imagine how a quantum memory can store them. 
Indeed, such a memory is naturally provided by the fol- 
lowing superposition of n entangled qbits: 
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(3) 



The only real question is how to generate this state uni- 
tarily from a simple initial state of n qbits. In [Q I pre- 
sented an algorithm which achieves this by loading se- 
quentially the classical patterns into an auxiliary quan- 
tum register from which they are then copied into the 
actual memory register. Here I will generalize this al- 
gorithm by constructing also the unitary operator which 
generates the memory state (||) directly from the state 
|0,...,0). 

Let me begin by reviewing the sequential algorithm of 
ref. 1^. I shall use three registers: a first register p of 
n qbits in which I will subsequently feed the patterns p^ 
to be stored, a utility register u of two qbits prepared in 
state |01), and another register m of n qbits to hold the 
memory. This latter will be initially prepared in state 
|0i, . . . , 0„). The full initial quantum state is thus 
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I^o) = b},...p,^;01;0i,...,0„) 



(4) 



The idea of the storage algorithm is to separate this state 
into two terms, one corresponding to the aheady stored 
patterns, and another ready to process a new pattern. 
These two parts will be distinguished by the state of the 
second utility qbit U2'- |0) for the stored patterns and |1) 
for the processing term. 

For each pattern to be stored one has to perform 
the operations described below: 



ivi> = n 2^oi?^ 



■p\u2mj 



(5) 



This simply copies pattern into the memory register 
of the processing term, identified by \u2) = 



\ipl)= nXOR^^,..m„m\tpl) 



(6) 



The first of these operations makes all qbits of the mem- 
ory register |l)'s when the contents of the pattern and 
memory registers are identical, which is exactly the case 
only for the processing term. Together, these two op- 
erations change the first utility qbit ui of the processing 
term to a |1), leaving it unchanged for the stored patterns 
term. 



1^3 



(7) 



This is the central operation of the storing algorithm. It 
separates out the new pattern to be stored, already with 
the correct normalization factor. 



IV'5 



nXORr, 



(8) 



These two operations are the inverse of eqs.(g) and re- 
store the utility qbit ui and the memory register m to 
their original values. After these operations on has 
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described. At the end of the whole process, the m-register 
is exactly in state \m), eq. (||). 

Any quantum state can be generically obtained by a 
unitary transformation of the initial state |0, . . . , 0). This 
is true also for the memory state \m). In the following 
I will explicitly construct the unitary operator M which 
implements the transformation |m) = M |0, . . . , 0). 

To this end I introduce first the single-qbit unitary 
gates 



Uj = cos 
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where a2 is the second Pauli matrix. These operators 
are such that their product over the n qbits generates 
pattern out of |0, . . . , 0): 



\f)^P'^ |0,...,0) , 

n 



(12) 



I now introduce, in addition to the memory register 
proper, the same two utility qbits as before, also initially 
in the state |0). The idea is, exactly as in the sequential 
algorithm, to split the state into two parts, a storage term 
with 1^2) = |0) and a processing term with \u2) = |1)- 
Therefore I generalize the operators P* defined above to 



n cuu2, , 



(13) 



which loads pattern into the memory register only for 
the processing term. It is then easy to check that 

|m;00) =M |0,...,0;00) , 

^ - n [{CPiy'NOT^,CSZtl-^XOR^,^,CPl 



X NOT^, 



(14) 



The storage algorithm is thus efficient in the sense that 
the number p(2n -I- 3) -I- 1 of elementary one- and two-qbit 
gates needed to implement it, is linear in n for fixed p. 
Note that, by construction, there are no restrictions on 
the loading factor p/n. However, the storage algorithm 
is efficient in an absolute sense only for p polynomial in 
n. 



With the last operation, 



III. REMEMBERING 



1^7) = n ^xoRp. 



U2mj 



j=n 



(10) 



one restores the third register m of the processing term, 
the second term in eq.(^) above, to its initial value 
|0i, . . . ,0„). At this point one can load a new pattern 
into register p and go through the same routine as just 



A memory is of real value only if it can be used re- 
peatedly. This poses a problem since, as we shall see in 
the next section, an output of the memory is obtained 
by measuring the memory register and the rules of quan- 
tum mechanics imply that, when the memory register is 
measured, all the information about the entangled super- 
position of stored patterns is lost. If one does not want 
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to forget everything after the first information retrieval 
one must therefore find a way to store the information 
for repeated use. 

In quantum mechanics there are many choices to do 
this, since information can be stored, with various de- 
grees of compression, both in quantum states and in uni- 
tary operators. The most compressed storage would be 
a quantum state: in this case up to 2" patterns can 
be stored using only n (quantum) degrees of freedom. 
To this end, however, one would have to keep a master 
copy of the memory and produce copies out of it when 
needed. Unfortunately, this is impossible since the linear- 
ity of quantum mechanics forbids exact universal cloning 
of quantum states [0. Universal cloning [|l5| has two 
disadvantages: first of all the copies to be used for in- 
formation retrieval are imperfect, though optimal [ p^ ; 
secondly, the quality of the master copy decreases with 
each recall, i.e. the memory is quickly washed out. 

This leaves state-dependent cloning as the only viable 
option if one wants to store at least part of the infor- 
mation in a quantum state. State-dependent doners are 
designed to reproduce only a finite number of states and 
this is definitely enough for our purposes. The simplest 
option in this setting is to use a probabilistic cloning ma- 
chine [|r^. To this end it is sufficient to consider any 
dummy state \d) different from \m) (for more than two 
states the condition would be linear independence) and 
to construct a probabilistic cloning machine for these two 
states. This machine reproduces |m) with probability pm 
and \d) with probability pd', a flag tells us exactly when 
the desired state |m) has been obtained. In order to ob- 
tain an exact copy of \m) one needs then 1/pm trials on 
average. The master copy is exactly preserved. 

The cloning efficiencies of the probabilistic doner of 
two states are bounded as follows llTtl: 



Pm+Pd < 



1 + {d\7 



(15) 



This bound can be made large by choosing \d) as nearly 
orthogonal to |m) as possible. A simple way to achieve 
this for a large number of patterns is to encode also the 
state 



^ i:(-i)-«i.-> 



(16) 



together with |m) when storing information. This can 
be done simply by using alternately the operators S"* and 
(S'*) in the storing algorithm of section 2. For binary 
patterns which are all different from one another one has 
then 



(d\m)^ , 

{d\m)= - , 
P 



p even 
p odd 



(17) 



and the bound for the cloning efficiencies is very close to 
its maximal value 2 in both cases. 



The quantum network for the probabilistic doner of 
two states has been developed in It can be con- 

structed exclusively out of the two simple distinguishabil- 
ity tranfer (D) and state separation (S) gates. Note that 
these gates embody information about the two states to 
be cloned. Part of the memory, therefore, actually re- 
sides in the cloning network, which is unavoidable if one 
wants to use a quantum memory repeatedly. This is then 
a mixed solution in which part of the information resides 
in a quantum state and another part in a unitary opera- 
tor implemented as a probabilistic cloning network. 

At the other end of the spectrum one can store the 
information about the p patterns entirely in the unitary 
operator M in eq.(|lj). Each time one needs to retrieve 
information one prepares then an initial quantum state 
|0, . . . , 0) and one transforms it into |m) by applying AI 
to it. In this case one needs pn bits of information to 
store the p patterns, exactly as in the classical Hopfield 
model. Indeed, this way of storing the patterns is very 
close in spirit to the Hopfield model since a unitary op- 
erator can always be represented as the exponential of a 
Hamiltonian operator, which is the quantum generaliza- 
tion of an energy functional. As I now show, however, in 
the quantum case there are no restrictions on the number 
of patterns that can be stored and retrieved. 



IV. RETRIEVING INFORMATION 

Assume now one is given a binary input i, which might 
be, e.g. a corrupted version of one of the patterns stored 
in the memory. The retrieval algorithm requires also 
three registers. The first register i of n qbits contains 
the input pattern; the second register m, also of n qbits, 
contains the memory |m); finally there is a control regis- 
ter c with b qbits all initialized in the state |0). 

The full initial quantum state is thus: 



1 ^ 

IV'o) = — ^|z;/;Oi,...,0, 

k=i 



(18) 



where \i) — . . . ,i„) denotes the input qbits, the sec- 
ond register, to, contains the memory (^ and all b control 
qbits are in state |0). Applying the Hadamard gate to 
the first control qbit one obtains 

1 

l^i)^^ VN;p^0i,...,06) 



fc=i 
p 



]= ^|z;/;li,...,Ofc 



(19) 



fc=i 



I now apply to this state the following combination of 
quantum gates: 

n 

1^2) = n ^OR,^m, 1^1) , (20) 
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As a result of the above operation the memory register 
qbits are in state |1) if ij and pj are identical and |0) 
otherwise: 

1 ^ 



p f) 



fc=i 
p 



fc— 1 



(21) 



where d*^ = 1 if and only if ij = pj and d'^ — otherwise. 



Consider now the following Hamiltonian: 



y-v / (73 + 1 



(22) 



where (T3 is the third Pauli matrix. Ti measures the num- 
ber of O's in register to, with a plus sign if ci is in state 
|0) and a minus sign if ci is in state |1). Given how I have 
prepared the state \ip2), this is nothing else than the num- 
ber of qbits which are different in the input and memory 
registers i and to. This quantity is called the Hamming 
distance and represents the (squared) Euclidean distance 
between two binary patterns. 

Every term in the superposition ( pT]) is an eigenstate of 
Ti with a different eigenvalue. Applying thus the unitary 
operator exp(i7r7i/2ri) to \tp2) one obtains 



1^3)= ^ E e^^''-^^'^') N; d^ Oi, . . . , 0,) 



(23) 



fc=i 
p 



fc=i 

where dn {i,p'') denotes the Hamming distance bewteen 
the input i and the stored pattern p'^ . 

In the final step I restore the memory gate to the state 
I to) by applying the inverse transformation to eq. ( |20| ) 
and I apply the Hadamard gate to the control qbit ci, 
thereby obtaining 

1 

1^4)= He, n XOR,^„^, NOT,n, IV-s) , (24) 

1 ^ 

—^cos ^dn (^,/) |i;p'=;0i,...,06) 

VP k=i 

1 ^ TT 

-F— ^sin ^dn {i,p^) |i;p'=;li,...,06). 

The idea is now to repeat the above operations sequen- 
tially for all h control qbits c\ to c^. This gives 



IV'fin) 



Sin' (l^dH 



EN;p';^')' (25) 
{J'} 



where {J^'} denotes the set of all binary numbers of h 
bits with exactly I bits 1 and (6 — V) bits 0. 

As in the case of the storing algorithm, there is a ver- 
sion of the information retrieval algorithm in which the 
input is not loaded into an auxiliary quantum register 
but rather into a unitary operator. Indeed, the auxil- 
iary quantum register is needed only by the operator ( [20| ) 
leading from (|l9|) to (pl|). The same result (apart from 
an irrelevant overall sign) can be obtained by applying 



i= n ' 

Vi= sin 



1 + « cos 



02 



(26) 



directly on the memory state |to). The rest of the al- 
gorithm is the same, apart the reversing of the operator 
(P0| ) which needs now the operator . 

The end effect of the information retrieval algorithm 
represents a rotation of the memory quantum state in 
the enlarged Hilbert space obtained by adding h control 
qbits. Note that the overall effect of this rotation is an 
overall amplitude concentration on memory states simi- 
lar to the input if there is a large number of |0) control 
qbits and an amplitude concentration on states different 
to the input if there is a large number of |1) control qbits. 
As a consequence, the most interesting state for informa- 
tion retrieval purposes is the projection of iV'fin) onto the 
subspace with all control qbits in state |0). 

There are two ways of obtaining this projection. The 
first is to repeat the above algorithm and measure the 
control register several times, until exactly the desired 
state for the control register is obtained. If the number 
of such repetitions exceeds a preset threshold T the in- 
put is classified as "non-recognized" and the algorithm 
is stopped. Otherwise, once |ci, . . . , Cfc) ~ |0i, . . . , Ob) is 
obtained, one proceeds to a measurement of the mem- 
ory register to, which yields the output pattern of the 
memory. 

The second method is to apply T steps of the ampli- 
tude amplification algorithm JTo| rotating |'(/'fin) towards 
its projection onto the "good" subspace formed by the 
states with all control qbits in state |0). To this end it is 
best to use the versions of the storing and retrieving algo- 
rithms which do not need any auxiliary quantum register 
for inputs. Let me define as the input-dependent 
operator which rotates the memory state in the Hilbert 
space enlarged by the h control qbits towards the final 
state I ■(/'fin) in eq. (Ea) (where I now omit the auxiliary 
register for the input): 



1^. 



'fin; 



R(i) |TO;0i,...,0b) 



(27) 
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By adding also the two utility qbits needed for the storing 
algorithm one can then obtain IV'fin) as a unitary trans- 
formation of the initial state with all qbits in state |0): 

IV'fin; 00) = R{i)M |0, . . . , 0; Oi, . . . , 0,; 00) . (28) 

The amplitude amplification rotation of l^'fin! 00) towards 
its "good" subspace in which all b control qbits are in 
state |0) is then obtained jl^ by repeated application of 
the operator 

Q = -R{i)MSQM-^R-^{i)S (29) 

on the state jV'finiOO). Here S conditionally changes the 
sign of the amplitude of the "good" states with the h 
control qbits in state |0), while Sq changes the sign of 
the amplitude if and only if the state is the zero state 
|0, . . . , 0; Oi, . . . , Ob; 00). As before, if a measurement of 
the control register after the T iterations of the amplitude 
amplification rotation yields |0i, . . . , Ob) one proceeds to a 
measurement of the memory register; otherwise the input 
is classified as " non- recognized" . 

Since the expected number of repetitions needed to 
measure the desired control register state is l/P^*"^, with 

Pr^- E cos^^rfdnO;/)) (30) 

the probability of measuring |ci, . . . , c„) = |0i, . . . , 0„), 
the threshold T governs the recognition efficiency of the 
input patterns. Note, however, that amplitude amplifi- 
cation provides a quadratic boost [|lO| to the recognition 
efficiency since only 1/ ^/P^ steps are required to rotate 
IV'fin) onto the desired subspace. Accordingly, the thresh- 
old T can be lowered to -s/T with respect to the method 
of projection by measurement. 

Once the input pattern i is recognized, the measure- 
ment of the memory register yields the stored pattern 
with probability 

Pb(/)=|cos2''(|-d^(z,p'=)) , (31) 

Z=pPr = Ecos2^(|^dif(*,p')) ■ (32) 
fe=i 

Clearly, this probability is peaked around those patterns 
which have the smallest Hamming distance to the input. 
The highest probability of retrieval is thus realized for 
that pattern which is most similar to the input. This 
is always true, independently of the number of stored 
patterns. In particular there are never spurious memo- 
ries: the probability of obtaining as output a non-stored 
pattern is always zero. As a consequence there are no 
restrictions on the loading factor p/n coming from the 
information retrieval algorithm. 

In addition to the threshold T, there is a second tun- 
able parameter, namely the number b of control qbits. 



This new parameter b controls the identification effi- 
ciency of the quantum memory since, increasing 6, the 
probability distribution Pb (p'^) becomes more and more 
peaked on the low (i,p'^) states, until 

lim Pb - 4fc„.„ , (33) 

where k^in is the index of the pattern (assumed unique 
for convenience) with the smallest Hamming distance to 
the input. 

The probability of recognition is determined by com- 
paring (even) powers of cosines and sines of the distances 
to the stored patterns. It is thus clear that the worst 
case for recognition is the situation in which there is an 
isolated pattern, with the remaining patterns forming a 
tight cluster spanning all the largest distances to the first 
one. As a consequence, the threshold needed to recog- 
nize all patterns diminishes when the number of stored 
patterns becomes very large, since, in this case, the dis- 
tribution of patterns becomes necessarily more homoge- 
neous. Indeed, for the maximal number of stored pat- 
terns p = 2" one has P™'^ = 1/2^ and the recognition 
efficiency becomes also maximal, as it should be. 

While the recognition efficiency depends on compar- 
ing powers of cosines and sines of the same distances in 
the distribution, the identification efficiency depends on 
comparing the (even) powers of cosines of the different 
distances in the distribution. Specifically, it is best when 
one of the distances is zero, while all others are as large 
as possible, such that the probability of retrieval is com- 
pletely peaked on one pattern. As a consequence, the 
identification efficiency is best when the recognition effi- 
ciency is worst and viceversa. 

The role of the parameter b becomes familiar upon 
a closer examination of eq.( |3l| ). Indeed, the quantum 
distribution described by this equation is equivalent to 
a canonical Boltzmann distribution with (dimensionless) 
temperature t — 1/b and (dimensionless) energy levels 

i?^ = -2 log cos (*,/)) , (34) 

with Z playing the role of the partition function. 

The appearance of an effective thermal distribution 
suggests studying the average behaviour of quantum 
associative memories via the corresponding thermody- 
namic potentials. Before this can be done, however, one 
must deal with the different distributions of stored pat- 
terns characterizing each individual memory. To this end 
I propose to average also over this distribution, by keep- 
ing as a tunable parameter only the minimal Hamming 
distance d between the input and the stored patterns. In 
doing so, one obtains an average description of the av- 
erage memory. This is essentially the replica trick used 
to derive the behaviour of spin glasses and classical 
Hopfield models 

As a first step it is useful to normalize the pattern rep- 
resentation by adding (modulo 2) to all patterns, input 
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included, the input pattern i. This clearly preserves all 
Hamming distances and has the effect of normalizing the 
input to be the state with all qbits in state |0). The 
Hamming distance dn {hp'') becomes thus simply the 
number of qbits in pattern p*^ with value For loading 
factors p/n — > in the limit n ~~^ oo the partition func- 
tion for the average memory takes then a particularly 
simple form: 



P 



EE A. 

{A} j=d 



„2b 



2 n 



(35) 



where Xj describes an unconstrained probability distribu- 
tion such that X]J=d = 1; {^} is the set of such distri- 
butions and A^A the corresponding normalization factor. 
For finite loading factors, instead, the probabilities Xj 
become subject to constraints which make things more 
complicated. 

I now introduce the free energy F{b, d) by the usual 
definition 



(36) 



where I have chosen a normalization such that exp(— &_F) 
describes the deviation of the partition function from 
its value for 6 = (high effective temperature). Since 
Z/p, and consequently also Z^^jp posses a finite, non- 
vanishing large-n limit, this normalization ensures that 
F{b,d) is intensive, exactly like the energy levels (34), 
and scales as a constant for large n. This is the only dif- 
ference with respect to the familiar situation in statistical 
mechanics. 

The free energy describes the equilibrium of the sys- 
tem at effective temperature t = 1/b and has the usual 
expression in terms of the internal energy U and the en- 
tropy S: 



F{t, d)^ U{t, d)-tS{t,d) 
U{t,d)={E)t, S{t,d) = 



-dFjt, d) 
dt 



(37) 



Note that, with the normalization I have chosen in (36), 
the entropy S is always a negative quantity describing 
the deviation from its maximal value Smax = at t = oo. 

By inverting eq.(|3j) with F substituting E one can 
also define an effective (relative) input/output Hamming 
distance V at temperature t: 



2 ~F(t,d) 

I?(i, d) — — arccos e ^ 

TT 



(38) 



This corresponds exactly to representing the recognition 
probability of the average memory as 



{pr).. = cos''[^vib,d)) , 



(39) 



which can also be taken as the primary definition of the 
effective Hamming distance. 



The function X>(6, d) provides a complete description 
of the behaviour of quantum associative memories in the 
limit p/n <^ 1. This can be used to tune their perfor- 
mance. Indeed, suppose that one wants the memory to 
recognize and identify inputs with up to en corrupted in- 
puts with an efhciency of (0 < < 1). Then one must 
choose a number b of control qbits sufficiently large that 
(!?(&, en) — e) < (1 — J^) and a threshold T of repetitions 
satisfying T > 1/cos^'' (|-I?(6, en)), as illustrated in Fig. 
1 below. 

A first hint about the general behaviour of the effective 
distance function I?(6, d) can be obtained by examining 
closer the energy eigenvalues (^4|). For small Hamming 
distance to the input these reduce to 



1h [t,p 



n 



< 1 



(40) 



Choosing again the normalization in which \i) — |0 . . . 0) 
and introducing a "spin" with value — —1/2 if qbit 
i in pattern p'^ has value |0) and s*^ = -1-1/2 if qbit i 
in pattern p'^ has value one can express the energy 
levels for d^ /n 1 as 



16 + 4n2 2^ 

I'd 



An ^ ' 



(41) 



Apart from a constant, this is the Hamiltonian of an 
infinite-range antifcrromagnetic Ising model in presence 
of a magnetic field. The antiferromagnetic term favours 
configurations k with half the spins up and half down, so 
that Stot = J^i^i = 0' giving E'' = 7r2/16.The mag- 
netic field, however, tends to align the spins so that 
sJqj — ~n/2, giving E'^ — 0. Since this is lower than 
71^/16, the ground state configuration is ferromagnetic, 
with all qbits having value |0). At very low tempera- 
ture (high 6), where the energy term dominates the free 
energy, one expects thus an ordered phase of the quan- 
tum associative memory with Vit, d) = d/n. This cor- 
responds to a perfect identification of the presented in- 
put. As the temperature is raised {b decreased) however, 
the thermal energy embodied by the entropy term in the 
free energy begins to counteract the magnetic field. At 
very high temperatures (low b) the entropy approaches 
its maximal value S{t — oo) = (with the normalization 
chosen here). If this value is approached faster than 1/t, 
the free energy will again be dominated by the internal 
energy . In this case, however, this is not any more deter- 
mined by the ground state but rather equally distributed 
on all possible states, giving 



F{t = oo)= U{t = oo) 



T / dx 2 log cos ( —X 

^ V2 



and leading to an effective distance 



(42) 
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This value corresponds to a disordered phase with no 
correlation between input and output of the memory. 

A numerical study of the thermodynamic potentials in 
( ^ ) and ( psf ) indeed confirms a phase transition from the 
ordered to the disordered phase as the effective temper- 
ature is raised. In Fig. 1 I show the effective distance 
V and the entropy 5* for 1 Mb (n = 8 x 10^) patterns 
and d/n = 1% as a function of the inverse temperature 
h (the entropy is rescaled to the interval [0,1] for ease of 
presentation). At high temperature there is indeed a dis- 
ordered phase with S = 5,nax = and V — 2/?,. At low 
temperatures, instead, one is in the ordered phase with 
S = S'min and T) = d/n = 0.01. The effective Hamming 
distance plays thus the role of the order parameter for 
the phase transition. 



Order-Disorder Transition of QAiVIs 

i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "i""i ' "I' 




Number b of control qbits 

FIG. 1. Efltective input/output distance and entropy 
(rescaled to [0,1]) for 1Mb patterns and d/n — 1%. 

The phase transition occurs at b„ ~ 10^^. The physi- 
cal regime of the quantum associative memory (6 = posi- 
tive integer) begins thus just above this transition. For a 
good accuracy of pattern recognition one should choose 
a temperature low enough to be well into the ordered 
phase. 

Having described at length the information retrieval 
mechanism for complete, but possibly corrupted pat- 
terns, it is easy to incorporate also incomplete ones. 
To this end assume that only q < n qbits of the in- 
put are known and let me denote these by the indices 
{kl, . . . ,kq}. After assigning the remaining qbits ran- 
domly, there are two possibilities. One can just treat the 
resulting complete input as a noisy one and proceed as 
above or, better, one can limit the operator (dn)^ in the 



Hamiltonian ( p^ ) to 

(dH)^ - i (^) , (44) 
i=i \ ^ '"fci 

so that the Hamming distances to the stored patterns are 
computed on the basis of the known qbits only. After this 
the pattern recall process continues exactly as described 
above. This second possibility has the advantage that it 
does not introduce random noise in the similarity mea- 
sure but it has the disadvantage that the operations of 
the memory have to be adjusted to the inputs. 

V. EFFICIENCY, COMPLEXITY AND MEMORY 
TUNING 

As anticipated in section 4, the effective i/o Hamming 
distance can be used to tune the quantum associative 
memory to prescribed accuracy levels. Typically, it is to 
be expected that increasing this accuracy will lead to an 
enhaiitswi complexity level. Before I even begin address- 
ing this issue, however, I will show that the information 
retrieval algorithm is efficient. 

First of all I would like to point out that, in addi- 
tion to the standard NOT, H (Hadamard), XOR, 2X0R 
(Toffoli) and nXOR gates M_^l have introduced only the 
two-qbit gates CS'' in eq. ^) and the unitary operator 
exp (z7r7i/2n). This latter can, however also be realized 
by simple gates involving only one or two qbits. To this 
end I introduce the single-qbit gate 

and the two-qbit controlled gate 

CU-^ ^\0){0\(g)l + \l){l\(g>U-^ . (46) 

It is then easy to check that exp (i7r7i/2n) in eq. ([2ll ) 
can be realized as follows: 

n n 

e^^« 1^2) = n (CC^"')c™. n t^™. 1^2) , (47) 

i=l 3=1 

where c is the control qbit for which one is currently re- 
peating the algorithm. Essentially, this means that one 
implements first exp (i7r(i_f//2n) and then one corrects by 
implementing exp (—indH /n) on that part of the quan- 
tum state for which the control qbit |c) is in state |1). 

Using this representation for the Hamming distance 
operator one can count the total number of simple gates 
that one must apply in order to implement one step of the 
information retrieval algorithm. This is given by (6n + 2) 
using the auxiliary register for the input and by (4n -|- 2) 
otherwise. This retrieval step has then to be repeated 
for each of the b control qbits. Therefore, implement- 
ing the projection by repeated measurements, the overall 
complexity C of information retrieval is bounded by 



8 



C < Tb{6n + 2)Ccion , 



(48) 



where Cdon is the complexity of the (probabiUstic) 
cloning machine that prepares a copy of the memory 
state. 

The computation of the complexity is easier for the 
information retrieval algorithm which uses the amplitude 
amplification technique. In this case the initial memory 
is prepared only once by a product of the operators M, 
with complexity p{2n + 3) + 1 and R(i), with complexity 
6(471 + 2). Then one applies T times the operator Q, with 
complexity p(4n + 6) + 5(8n + 4) + 2 + C5 + Cs(,, where Cs 
and Csq are the polynomial complexities of the oracles 
implementing S and Sq. This gives 

C= T [p(4n + 6) + b(8n + 4) + 2 + Cs + Cs,] + 
+p{2n + 3) + b{An + 2) + 1 . (49) 

As expected, this result depends on both T and b, the pa- 
rameters governing the recognition and identification effi- 
ciencies. This represents exactly the unavoidable tradeoff 
between accuracy and complexity. 

Suppose now one would like to recognize on average 
inputs with up to 1% of corrupted or missing bits and 
identify them with high accuracy. The effective i/o Ham- 
ming distance V shown in Fig. 1 can then be used to 
determine the values of the required parameters T and 
b needed to reach this accuracy for the average memory 
with p/n < 1. For b = 10'' e.g., one has V = 0.018, 
which gives the average i/o distance (in percent of total 
qbits) if the minimum possible i/o distance is 0.01. For 
this value of 6 the recognition probability is 1.5 10""*. 
With the measurement repetition technique one should 
thus set the threshold T — 0.6 10'*. Using amplitude 
amplification, however, one needs only around T = 80 
repetitions. 

I would like to conclude by stressing that the values 
of b and T obtained by tuning the memory with the 
effective i/o Hamming distance become n-independent 
for large values of n. This is because they are inten- 
sive variables unaffected by this "thermodynamic limit". 
For any p polynomial in n the information retrieval can 
then be implemented efficiently and the overall complex- 
ity is determined by the accuracy requirements via the 
n-independent parameters T and b. 



VI. CONCLUSION 

I would like to conclude this review by stressing the 
reason why a quantum associative memory works better 
than its classical counterpart. 

In classical associative memories, the information 
about the patterns to recall is typically stored in an en- 
ergy functional. When retrieving information, the input 
configuration evolves to the corresponding output, driven 
by the memory functional. The capacity shortage is due 
to a phase transition in the statistical ensemble governed 



by the memory energy functional. Spurious memories, 
i.e. spurious metastable minima not associated with any 
of the original patterns become important for loading 
factors p/n > 0.14 and wash out completely the memory. 
So, in the low p/n phase the memory works perfectly in 
the sense that it outputs always the stored pattern which 
is most similar to the input. For p/n > 0.14, instead, 
there is an abrupt transition to total amnesia caused by 
spurious memories. 

Quantum associative memories work better than clas- 
sical ones since they are free from spurious memories. 
The easiest way to see this is in the formulation 



|to) = M |0) . 



(50) 



All the information about the stored patterns is encoded 
in the unitary operator M. This is such that all quantum 
states that do not correspond to stored patterns have 
exactly vanishing amplitudes. 

An analogy with the classical Hopfield model |^ can 
be established as follows. Instead of generating the mem- 
ory state from the initial zero state one can start from a 
uniform superposition of the computational basis. This 
is achieved by the operator MW defined by 



2"-l 



|to)= MW - 

n 



(51) 



Now, the same result can also be obtained by Grover's 
algorithm, or better by its generalization with zero failure 
rate Here the state |m) is obtained by applying to 
the uniform superposition of the computational basis q 
times the search operator X defined by 



3=0 



X= ^WJoWJ 



(52) 



where J rotates the amplitudes of the states correspond- 
ing to the patterns to be stored by a phase (j) which is 
very close to tt (the original Grover value) for large n and 
Jq does the same on the zero state. Via the two equa- 
tions dH) and (11), the memory operator M provides an 
implicit realization of the phase shift operator J. Being 
a unitary operator, this can always be written as an ex- 
ponential of an hcrmitian Hamiltonian 7i, which is the 
quantum generalization of a classical energy functional. 
By defining J = exp(— iTY) one obtains an energy oper- 
ator which is diagonal in the computational basis. The 
energy eigenvalues of this operator are such that the pat- 
terns to be stored have energy E = ~(j> ~ —tt while all 
others have energy E = 0. This formulation is the exact 
quantum generalization of the Hopfield model; the im- 
portant point is that the operator M realizes efficiently a 
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dynamics in which the patterns to be stored are always, 
for all numbers p of patterns, the exact global minima 
of a quantum energy landscape, without the appearance 
of any spurious memories. The price to pay is the prob- 
abilistic nature of the information retrieval mechanism. 
As always in quantum mechanics, the dynamics deter- 
mines only the evolution of probability distributions and 
the probabilistic aspect is brought in by the collapse of 
this probability distributions upon measurement. There- 
fore, contrary to the classical Hopfield model in the low 
p/n phase, one does not always have the absolute guar- 
antee that an input is recognized, and identified correctly 
as the stored pattern most similar to the input, even if 
this state has the highest probability of being measured. 
But, after all, the same happens also to the human brain. 
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